Voice control system

ABSTRACT

A voice control system allows a user to control a device through voice commands. The voice control system includes a speech recognition unit that receives a control signal from a mobile device and a speech signal from a user. The speech recognition unit configures speech recognition settings in response to the control signal to improve speech recognition.

BACKGROUND OF THE INVENTION

1. Priority Claim

This application claims the benefit of priority from European PatentApplication No. 07002342.9, filed Feb. 2, 2007, which is incorporated byreference.

2. Technical Field

The invention is directed to speech recognition and, more particularly,to voice control systems.

3. Related Art

Voice control systems allow users to communicate with computers. Whenengaging a voice control system, a user may speak a word or series ofwords. The voice control system may receive these spoken words anddetermine whether any actions are associated with the detected words. Ifthe detected words represent a valid command, the voice control systemmay implement the associated command.

Voice control systems may provide a fast and efficient method ofcommunication. However, this benefit may diminish as the quality ofdetected speech signals declines. To improve the quality of detectedspeech signals, some voice control systems may use multi-channelmicrophone arrays and/or beamforming techniques. These efforts may notbe enough to improve the detected speech signals. Therefore, a needexists for a voice control system that provides a more reliablehuman-machine interaction.

SUMMARY

A voice control system allows a user to control a device through voicecommands. The voice control system includes a speech recognition unitthat receives a control signal from a mobile device and a speech signalfrom a user. The speech recognition unit configures speech recognitionsettings in response to the control signal to improve speechrecognition.

Other systems, methods, features and advantages will be, or will become,apparent to one with skill in the art upon examination of the followingfigures and detailed description. It is intended that all suchadditional systems, methods, features and advantages be included withinthis description, be within the scope of the invention, and be protectedby the following claims.

BRIEF DESCRIPTION OF THE DRAWINGS

The system may be better understood with reference to the followingdrawings and description. The components in the figures are notnecessarily to scale, emphasis instead being placed upon illustratingthe principles of the invention. Moreover, in the figures, likereferenced numerals designate corresponding parts throughout thedifferent views.

FIG. 1 is a voice control system.

FIG. 2 is a method of operating a voice control system.

FIG. 3 is a method of activating speech recognition functionality inresponse to a received control signal.

FIG. 4 is a method of transmitting a control signal from a mobile deviceupon detecting a movement of the mobile device.

FIG. 5 is a method of determining an identity of a user based on areceived control signal.

FIG. 6 is a method of determining a position of a user based on areceived control signal.

FIG. 7 is a method of performing multiple actions in response to areceived control signal.

FIG. 8 is a method of determining the acoustic properties of a room.

FIG. 9 is a method of transmitting detected speech signals from a mobiledevice to a speech recognition unit.

FIG. 10 is a method of transmitting a speech recognition result from amobile device to a speech recognition unit.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

FIG. 1 is a diagram of a voice control system that comprises a speechrecognition unit 102. The speech recognition unit 102 may be astationary unit. In some applications, the speech recognition unit 102may be installed in a room, such as a living room or an office. In otherapplications, the speech recognition unit 102 may be installed in avehicle. The speech recognition unit 102 may be part of an audio system,a visual system, or other electronic communication or entertainmentdevices. The speech recognition unit 102 may be located in a housing ofthe system controlled by voice command. Alternatively, the speechrecognition unit 102 may be separate from the system controlled by voicecommands.

The voice control system may include one or more receivers 104. Thereceivers 104 may be radio receivers, infrared receivers, acousticreceivers, or optical receivers. In some systems, the receivers 104 maybe Bluetooth receivers. Bluetooth is a short-distance wirelesscommunication technique that is relatively low in both cost and powerconsumption. The range of Bluetooth signals may be around ten meters.Alternatively, the range of Bluetooth signals may be extended to greaterdistances. In some systems, Bluetooth technology makes use of theIndustrial, Scientific, and Medical (“ISM”) band (about 2,402-about2,480 GHz) that is globally available and license-free. To achieve arelatively high interference resistance, Bluetooth technology may employa fast frequency hopping method with 1600 frequency changes per second.As an alternative to Bluetooth technology, the receivers 104 may use oneor more other communication protocols. In some systems, the receivers104 may be Wireless Local Area Network (“WLAN”) receivers. In othersystems, the receivers 104 may be infrared receivers, such as thosereceivers configured to receive infrared signals according to thestandards of the Infrared Data Association (“IrDA”). In still othersystems, the receivers 104 may be acoustic receivers, such as thosereceivers configured to receive high-frequency acoustic signals (e.g.,acoustic signals above about 20 kHz). In still yet other systems, thereceivers 104 may be optical receivers, such as those receiversconfigured to receive ultraviolet signals.

In FIG. 1, the speech recognition unit 102 communicates with tworeceivers 104, even though the system may communicate with more than tworeceivers 104 or a single receiver 104. The receivers 104 may determinea person's position. In systems that have multiple Bluetooth receivers,the Bluetooth receivers may perform Bluetooth mediated locating.Specifically, the Bluetooth receivers may use Bluetooth signals todetermine a place or location of a user.

The voice control system may include one or more speech detectiondevices. The speech detection devices may convert acoustic signals intoanalog or digital data. In some systems, the speech detection device mayinclude a device that converts sound into digital or analog signals. Inother systems, the speech detection device may include one or moremicrophones. In yet other systems, the speech detection device maycomprise a device that extracts voice from ambient noise, such as amicrophone array 106 of FIG. 1. One or more of the microphones in themicrophone array 106 may be directional microphones. The speechdetection device, such as the microphone array 106, may detect speechsignals from the user. The microphone array 106 may transmit thedetected speech signals to speech recognition functionality of thespeech recognition unit 102. The speech recognition functionality mayattempt to recognize the semantic content of the detected speechsignals.

The voice control system may communicate with one or more mobile devices108. The mobile devices 108 may be associated with users. Specifically,the mobile devices 108 may be carried by the users. The mobile devices108 may communicate with the speech recognition unit 102. The mobiledevices 108 may transmit one or more control signals to the radioreceivers 104 of the speech recognition unit 102. The control signalsmay indicate a presence or location of the user to the speechrecognition unit 102. The control signals may be non-audible signalsthat are independent of the semantic content of the speech signalsspoken by the users. The control signals transmitted from the mobiledeices 108 may be separate from the speech signals spoken by the users.The control signals may be infrared signals, radio signals, opticalsignals, and/or acoustic signals that may either be in or out of theaudible range. The mobile devices 108 may be mobile telephones,smartphones, personal digital assistants (“PDAs”), remote controls,headsets, hearing aids, or other mobile devices that may transmitcontrol signals.

When a user of the voice control system is wearing a binaural hearingaid, the radio signals created by the binaural hearing aid may be usedas control signals in the voice control system. The binaural hearing aidmay comprise a left hearing aid component and a right hearing aidcomponent. During operation, one or more radio signals may betransmitted between the left and right hearing aid components. Thespeech recognition unit 102 may detect the one or more radio signals todetermine a presence or location of the user in a vicinity of the speechrecognition unit 102. Alternatively, the binaural hearing aid mayinclude a second transmitter to transmit control signals to the speechdetection unit 102.

One or more users may be located in the vicinity of the speechrecognition unit 102. Each of the users may carry a mobile device 108 tocommunicate with the speech recognition unit 102. A first user in thevicinity of the speech recognition unit 102 may be carrying a mobiledevice 108 that transmits a first control signal. A second user in thevicinity of the speech recognition unit 102 may carry a mobile device108 that transmits a second control signal. The speech recognition unit102 may distinguish the first user from the second user based on one,two, or more differentiating characteristics of the first and secondcontrol signals. In some systems, the mobile devices 108 may transmitdifferent or varying control signals to the speech recognition unit 102.A modulator may vary frequency or other characteristic of the controlsignal prior to transmission. In other systems, the first and secondcontrol signals may carry information, such as a resource identifier,that may differentiate the first user from another user or users.

FIG. 2 is a method of operating the voice control system. At act 202, acontrol signal is received. Specifically, the speech recognition unit102 may receive a control signal transmitted from a mobile device 108.At act 204, an action is performed in response to the control signal.Specifically, the speech recognition unit 102 may configure one or morespeech recognition settings in response to the control signal to assistin a recognition of future speech signals from the user. At act 206, aspeech signal is received from the user. Specifically, the microphonearray 106, or other speech detection device, may detect a speech signalfrom the user. At act 208, the speech signal is analyzed to recognizesemantic content of the speech signal. Specifically, the speechdetection unit 102 may comprise speech recognition functionality thatrecognizes the content of the speech signal from the user. At act 210, avoice controlled operation is performed based on the content of therecognized speech signal. In one instance, the user may have uttered thekeyword “louder.” In response to the recognition of this utterance,voice control functionality of the speech recognition unit 102 mayincrease the volume of an audio system operated by voice control.

FIG. 3 is a method of activating speech recognition functionality inresponse to a received control signal. At act 302, a mobile device 108may be moved. Specifically, a user may carry the mobile device 108 intoa room where the speech recognition unit 102 is located. At act 304, acontrol signal is transmitted. Specifically, the mobile device 108transmits the control signal. In some systems, the mobile device 108 maytransmit a control signal repeatedly. In these systems, the mobiledevice 108 may transmit the signal on a synchronous or asynchronousschedule automatically without a user action. In other systems, themobile device 108 transmits a control signal in response to a user'sactivity. In these systems, the user may activate a graphical ormechanical element of the mobile device 108 to transmit the controlsignal to the speech recognition unit 102.

At act 306, the control signal is received and processed. At act 308,speech recognition functionality is activated. Activating the speechrecognition functionality in response to a received control signalallows the speech recognition functionality to detect and recognize afirst utterance of a user. Where the control signal is transmitted upona user's activity, the user may control when the speech recognitionfunctionality is activated. The user may delay activation of the speechrecognition functionality until the user is ready to interact with thevoice control system. When the user enters a room in proximity to thespeech recognition unit 102, the user may not want certain utterances tobe detected by the speech recognition unit 102.

FIG. 4 is a method of transmitting a control signal from a mobile device108 upon detecting a movement of the mobile device 108. As analternative to periodically transmitting control signals or onlytransmitting a control signal upon demand of the user, the mobile device108 may transmit a control signal upon detection of a movement of themobile device 108. At act 402, a movement of the mobile device 108 isdetected. Specifically, the mobile device 108 may comprise a motionsensor to detect movement of the mobile device 108. The motion sensormay be an acceleration sensor. At act 404, an indication of theoccurrence of the detected movement is sent to the control signaltransmission functionality of the mobile device 108. At act 406, thecontrol signal is transmitted. Specifically, the mobile device 108transmits the control signal in response to the detected movement of themobile device 108. In some systems, sending control signals only upon adetected movement of the mobile device 108 may conserve more power thanthe case of a periodically transmitted control signal.

FIG. 5 is a method of determining an identity of a user based on areceived control signal. At act 502, a control signal is received andprocessed. The speech recognition unit 102 may receive the controlsignal transmitted from a mobile device 108. At act 504, the identity ofthe user associated with the mobile device 108 is determined. The speechrecognition unit 102 may process user information from the controlsignal to determine the identity of the user associated with the mobiledevice 108. In some systems, the identity of the user is determined byprocessing information transmitted in a control signal from the mobiledevice 108. In other systems, the identity of the user is determined byprocessing a characteristic of the control signal itself. Specifically,the control signals may be adjusted, regulated or changed (e.g.,frequency, amplitude, or phase).

At act 506, one or more speech recognition settings are processed basedon the determined identity of the user. Specifically, the speechrecognition process may adjust one or more of the speech recognitionsettings based on the identity of the user to improve the reliability ofthe speech recognition results. In some processes, the speechrecognition process may select the language of vocabulary lists used forspeech recognition based on the determined identity of the user. Inother processes, the speech recognition process may use an individuallytrained code book based on the determined identity of the user. In yetother processes, the user may have previously configured otherpersonalized voice recognition features. Once the speech recognitionprocess determines the identity of the user from the control signal, theprocess may activate these personalized features.

When individual users are identified through control signals receivedfrom mobile devices 108, a prioritization may occur. In this situation,a verbal command given by one of several users present in a room may beoverruled by another command given by another speaker with a higherpriority. The speech recognition unit 102 or process may also adjust aninput, such as the microphone array 106 of FIG. 1, based on the priorityof each user. Specifically, the recognition unit 102 or process mayadjust the microphone array 106 to process the speech signals of higherpriority users.

FIG. 6 is a method of determining a position of a user based on areceived control signal. At act 602, a control signal is received andprocessed. Specifically, the speech recognition process may receive acontrol signal transmitted from a mobile source. At act 604, theposition of the user associated with the mobile source is identified.The speech recognition process may use information of the control signalto determine the position of the user associated with the mobile source.The speech recognition process may use transmission time measurements todetermine the location of the user. Where the mobile source transmitsthe control signal periodically or repeatedly, the speech recognitionprocess may track a position of the user as the user moves.

At act 606, one or more speech recognition settings are programmed basedon the determined position of the user. Specifically, the speechrecognition process may adjust one or more of the speech recognitionsettings based on the position of the user. Adaptation of parameters forspeech recognition and adjustment of the input or input array may occurdynamically over time. In some processes, the speech detection processadjusts the input or input array based on the detected position of theuser. The speech detection process may adjust the input array based onthe determined position of the user to aim a directional input of theinput array towards the position of the user.

The input or microphone array 106 may be adjusted according to thedetermined position/direction of the user in order to beamform thespeech signals detected by the multiple inputs or microphones of theinput or microphone array 106. The signal-to-noise ratio (“SNR”) of abeamformed signal may exceed the SNR of non-beamformed signals. Theperformance of beamforming devices however may depend on knowing theposition of the speaker relative to the input or microphone array 106.Knowing the position of the speaker allows the beamformer to set apreferential direction for detection of a wanted signal. Therefore,determining the position of the user based on the received controlsignals may improve the beamforming effort and thus the speechrecognition result.

FIG. 7 is a method of performing multiple actions in response to areceived control signal. At act 702, a control signal is received andprocessed. Specifically, the speech recognition unit 102 may receive acontrol signal transmitted from a mobile device 108. At act 704, speechrecognition functionality of the speech recognition unit 102 isactivated. At acts 706 and 708, the one or more speech recognitionsettings are configured based on the determined identity of the user. Atacts 710 and 712, the speech detection device is adjusted based on thedetermined position of the user.

FIG. 8 is a method of determining the acoustic properties of a room. Atact 802, a radio signal and an audio signal are transmitted. The audiosignal may be within the audible frequency range and the radio signalmay serve as the control signal. The mobile device 108 transmits theradio signal and the audio signal to the speech detection unit 102. Atact 804, the radio signal and the audio signal are processed by thespeech detection unit 102. Specifically, the audio signal may bedetected by the microphone array 106 and the radio signal may bedetected by the receiver 104.

At act 806, the position of the user may be determined. Specifically,the speech recognition unit 102 may determine the distance to the usercarrying the mobile device 108 by measuring the difference intransmission time of the simultaneously transmitted audio and radiosignals. At act 808, one or more speech recognition settings areconfigured based on the determined position of the user. Specifically,the microphone array 106 may be adjusted in accordance with thedetermined distance between the user and the microphone array 106. Oneor more parameters for signal processing may also be adjusted. Also,beamforming and signal filtering may be controlled and adapted based onthe determined distance and direction from which the audio and radiosignals are detected.

At act 810, the received audio signal is analyzed to determine theacoustic properties of the room. Specifically, the speech recognitionunit 102 may estimate the impulse response of theloudspeaker-room-microphone system. The loudspeaker of this system maybe part of the mobile device 108 that transmitted the audio signal. Theaudio signal transmitted by the mobile device 108 may be known by thespeech recognition unit 102. The audio signal may represent a jinglethat indicates to the user that voice control is activated. The signalcharacteristics of the audio signal (e.g., the frequency, the short-timespectrum, the power density, etc.) are stored in the speech recognitionunit 102 and compared with the detected and processed audio signaltransmitted by the mobile device 108 for analysis of the acousticcharacteristics of the room. Based on this comparison, the reverberationcharacteristics of the room and the impulse response of theloudspeaker-room-microphone system may be estimated.

At act 812, one or more speech recognition settings are configured basedon the determined acoustic properties of the room. Based on the acousticproperties of the room, the speech recognition unit 102 may adapt one ormore processing parameters used for speech recognition. In some systems,the speech recognition unit 102 may adjust noise reduction filtersand/or echo compensation filters based on the determined acousticcharacteristics of the room. Some systems use these filters beforespeech recognition to enhance the quality of the detected speechsignals. Adjusting the filters based on the determined acousticcharacteristics of the room may improve the reliability of speechrecognition at the speech recognition unit 102.

FIG. 9 is a method of transmitting detected speech signals from a mobiledevice 108 to a speech recognition unit 102. At act 902, the mobiledevice 108 (e.g., a mobile telephone) detects the speech signal from theuser. The input of the mobile device 108 used for speech detection maybe relatively close to the mouth of the user. Due to the proximity ofthe input to the user, the input of the mobile device 108 may not besignificantly affected by a noisy environment or other acousticproperties of the room. Therefore, in some systems or environments, thespeech signal detected by the input of the mobile device 108 may be morereliable than a speech signal detected at the speech recognition unit102.

At act 904, the mobile device 108 transmits the detected speech signalto the speech recognition unit 102. In some systems, the detected speechsignal is transmitted as part of a control signal sent between themobile device 108 and the speech recognition system 102. In othersystems, the detected speech signal is transmitted separately from thecontrol signals sent between the mobile device 108 and the speechrecognition system 102.

At act 906, the detected speech signal transmitted from the mobiledevice 108 is received at the speech recognition unit 102. At act 908,the speech recognition unit 102 attempts to determine a speechrecognition result. In some systems, the speech recognition unit 102uses the speech signal detected at the mobile device 108 to determinethe speech recognition result. In other systems, the speech recognitionunit 102 uses both the speech signal detected at the mobile device 108and a speech signal detected by a local device (e.g., a signal detectedby the microphone array 106) to determine the speech recognition result.In still other systems, only a portion of a speech signal transmittedfrom the mobile device 108 to the speech recognition unit 102 may usedfor speech recognition at the speech recognition unit 102. Because thesampling rate of standard mobile telephones may be about 8 kHz, speechrecognition of sibilants transmitted to the speech recognition unit 102may not be sufficiently reliable in some systems. Therefore, somesystems may use the speech recognition unit 102 to detect sibilantportions of the speech signal. The speech recognition unit 102 may thenuse both locally detected portions of the speech signal (e.g., sibilantportions) and portions detected at the mobile device 108 (e.g.,non-sibilant portions) to determine a speech recognition result.

FIG. 10 is a method of transmitting a speech recognition result from amobile device 108 to the speech recognition unit 102. At act 1002, themobile device 108 detects the speech signal from the user. At act 1004,the semantic content of the detected speech signal is recognized.Specifically, the mobile device 108 includes a speech recognizer thatrecognizes the content of the detected speech signal.

At act 1006, the mobile device 108 transmits the speech recognitionresult to the speech recognition unit 102. In some systems, the speechrecognition result is transmitted as part of a control signal sentbetween the mobile device 108 and the speech recognition unit 102. Inother systems, the speech recognition result is transmitted separatelyfrom the control signals sent between the mobile device 108 and thespeech recognition unit 102. Specifically, the mobile device 108 maytransmit a Bluetooth signal to convey the speech recognition result tothe speech recognition unit 102.

At act 1008, the speech recognition unit 102 receives the speechrecognition result from the mobile device 108 and performs a voicecontrol action in response. In one instance, the user may utter thephrase “voice control on.” The speech recognition functionality of themobile device 108 may recognize the phrase and form a Bluetooth signalto be sent to the speech recognition unit 102. The speech recognitionunit 102 may then activate speech recognition functionality in response.In some systems, the speech recognition signals from the mobile device108 may be given priority over a speech signal detected and recognizedat the stationary unit 102.

Each of the processes described may be encoded in a computer readablemedium such as a memory, programmed within a device such as one or moreintegrated circuits, one or more processors or may be processed by acontroller or a computer. If the processes are performed by software,the software may reside in a memory resident to or interfaced to astorage device, a communication interface, or non-volatile or volatilememory in communication with a transmitter. The memory may include anordered listing of executable instructions for implementing logic. Logicor any system element described may be implemented through opticcircuitry, digital circuitry, through source code, through analogcircuitry, or through an analog source, such as through an electrical,audio, or video signal. The software may be embodied in anycomputer-readable or signal-bearing medium, for use by, or in connectionwith an instruction executable system, apparatus, or device. Such asystem may include a computer-based system, a processor-containingsystem, or another system that may selectively fetch instructions froman instruction executable system, apparatus, or device that may alsoexecute instructions.

A “computer-readable medium,” “machine-readable medium,”“propagated-signal” medium, and/or “signal-bearing medium” may compriseany device that contains, stores, communicates, propagates, ortransports software for use by or in connection with an instructionexecutable system, apparatus, or device. The machine-readable medium mayselectively be, but not limited to, an electronic, magnetic, optical,electromagnetic, infrared, or semiconductor system, apparatus, device,or propagation medium. A non-exhaustive list of examples of amachine-readable medium would include: an electrical connection havingone or more wires, a portable magnetic or optical disk, a volatilememory such as a Random Access Memory “RAM,” a Read-Only Memory “ROM,”an Erasable Programmable Read-Only Memory (EPROM or Flash memory), or anoptical fiber. A machine-readable medium may also include a tangiblemedium upon which software is printed, as the software may beelectronically stored as an image or in another format (e.g., through anoptical scan), then compiled, and/or interpreted or otherwise processed.The processed medium may then be stored in a computer and/or machinememory.

Although selected aspects, features, or components of theimplementations are described as being stored in memories, all or partof the systems, including processes and/or instructions for performingprocesses, consistent with the system may be stored on, distributedacross, or read from other machine-readable media, for example,secondary storage devices such as hard disks, floppy disks, and CD-ROMs;a signal received from a network; or other forms of ROM or RAM residentto a processor or a controller.

Specific components of a system may include additional or differentcomponents. A controller may be implemented as a microprocessor,microcontroller, application specific integrated circuit (ASIC),discrete logic, or a combination of other types of circuits or logic.Similarly, memories may be DRAM, SRAM, Flash, or other types of memory.Parameters (e.g., conditions), databases, and other data structures maybe separately stored and managed, may be incorporated into a singlememory or database, or may be logically and physically organized in manydifferent ways. Programs and instruction sets may be parts of a singleprogram, separate programs, or distributed across several memories andprocessors.

While various embodiments of the invention have been described, it willbe apparent to those of ordinary skill in the art that many moreembodiments and implementations are possible within the scope of theinvention. Accordingly, the invention is not to be restricted except inlight of the attached claims and their equivalents.

1. A system, comprising: a mobile device configured to transmit acontrol signal; a speech recognition unit configured to receive thecontrol signal from the mobile device and a speech signal from a userassociated with the mobile device; and where the speech recognition unitconfigures one or more speech recognition settings in response to thecontrol signal to assist in a recognition of the speech signal.
 2. Thesystem of claim 1, where the speech recognition unit comprises astationary unit, and where the mobile device is configured to transmitthe control signal to indicate a position of the user to the speechrecognition unit.
 3. The system of claim 1, where the control signalcomprises one of a plurality of control signals periodically transmittedby the mobile device.
 4. The system of claim 1, where the mobile deviceis configured to transmit the control signal upon a user's demand. 5.The system of claim 1, where the mobile device comprises a motionsensor, and where the mobile device is configured to transmit thecontrol signal upon a movement detected by the motion sensor.
 6. Thesystem of claim 1, where the control signal comprises an infraredsignal, a radio signal, an optical signal, or an acoustic signal.
 7. Thesystem of claim 1, where the control signal comprises a non-audiblesignal independent of semantic content of the speech signal.
 8. Thesystem of claim 1, where upon receipt of the control signal, the speechrecognition unit activates speech recognition functionality to recognizethe speech signal from the user.
 9. The system of claim 1, where thespeech recognition unit is configured to process the control signal todetermine an identity of the user.
 10. The system of claim 9, where thespeech recognition unit configures one or more speech recognitionsettings based on the identity of the user.
 11. The system of claim 1,where the speech recognition unit is configured to process the controlsignal to determine a position of the user, where the speech recognitionunit comprises a speech detection device, and where the speechrecognition unit is configured to adjust the speech detection devicebased on the determined position of the user.
 12. The system of claim11, where the speech detection device comprises a microphone array thatincludes a directional microphone, and where the speech recognition unitis configured to aim the directional microphone towards the position ofthe user.
 13. The system of claim 1, where the mobile device comprises amobile phone.
 14. The system of claim 1, where the mobile devicecomprises a binaural hearing aid worn by the user that transmits one ormore radio signals between left and right hearing aid components, andwhere the speech recognition unit is configured to detect the one ormore radio signals to determine a presence or position of the userrelative to the speech recognition unit.
 15. The system of claim 1,where the mobile device is configured to transmit the control signal andan audio signal, and where the speech recognition unit is configured toreceive the audio signal and determine one or more acousticcharacteristics of a room in which the speech recognition unit islocated based on one or more characteristics of the audio signalreceived by the speech recognition unit.
 16. The system of claim 15,where the speech recognition unit is configured to adjust one or morespeech recognition settings based on the one or more acousticcharacteristics of the room.
 17. The system of claim 1, where the mobiledevice is configured to detect the speech signal from the user, andwhere the mobile device is configured to send information related tosemantic content of the speech signal to the speech recognition unit.18. The system of claim 17, where the speech recognition unit comprisesa speech detection device that receives the speech signal from the user,and where the speech recognition unit is configured to use the speechdetection device and the information related to the semantic content ofthe speech signal received from the mobile device to recognize thespeech signal.
 19. The system of claim 1, where the user comprises afirst user, where the mobile device comprises a first mobile device,where the control signal comprises a first control signal; where thesystem further comprises a second mobile device configured to transmit asecond control signal to indicate a presence or position of a seconduser relative to the speech recognition unit; and where the speechrecognition unit distinguishes the first user from the second user basedon one or more differentiating characteristics of the first and secondcontrol signals.
 20. A system, comprising: a receiver configured toreceive a control signal from a mobile device associated with a user; aspeech detection device configured to receive a speech signal from theuser; means for analyzing the control signal to determine a position ofthe user; means for adjusting the speech detection device based on theposition of the user relative to the speech detection device; and meansfor recognizing the speech signal.
 21. The system of claim 20, where thespeech detection device comprises a microphone array that includes adirectional microphone, and where the means for adjusting comprisesmeans for adjusting the microphone array to aim the directionalmicrophone towards the position of the user.
 22. A method, comprising:receiving a control signal from a mobile device associated with a user;analyzing the control signal to determine a position of the user; andadjusting a speech detection device according to the position of theuser to assist in a recognition of a speech signal from the user. 23.The method of claim 22, where the speech detection device comprises amicrophone array that includes a directional microphone, and where theact of adjusting comprises adjusting the microphone array to aim thedirectional microphone towards the position of the user.
 24. The methodof claim 22, further comprising activating speech recognitionfunctionality upon receipt of the control signal from the mobile device.25. The method of claim 22, further comprising: analyzing the controlsignal to determine an identity of the user; and configuring one or morespeech recognition settings based on the identity of the user.